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Abstract 

D ■ In 1930, Godel j7] presented in Konigsberg his famous Incompleteness Theorem, stat- 

ing that some true mathematical statements are unprovable. Yet, this result gives us no 
idea about those independent (that is, true and unprovable) statements, about their fre- 
C*~) | quency, the reason they are unprovable, and so on. Calude and Jiirgensen [4] proved in 2005 

Chaitin's "heuristic principle" for an appropriate measure: the theorems of a finitely- specified 
theory cannot be significantly more complex than the theory itself (see |H|). In this work, 
we investigate the existence of other measures, different from the original one, which satisfy 
this "heuristic principle". At this end, we introduce the definition of acceptable complexity 
• measure of theorems. 
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O 

H-l 



1 Introduction 

> . 

In 1931, Godel [7] presented in Konigsberg his famous (first) Incompleteness Theorem, stat- 
ing that some true mathematical statements are unprovable. More formally and in modern 
, terms, it states the following: 

Every computably enumerable, consistent axiomatic system containing elemen- 
tary arithmetic is incomplete, that is, there exist true sentences unprovable by 
0^ . the system. 

o ■ 

The truth is here defined by the standard model of the theory we consider. Yet, this result 
gives us no idea about those independent (that is, true and unprovable) statements, about 
their frequency, the reason they are unprovable, and so on. Those questions of quantitative 
results about the independent statements have been investigated by Chaitin [5] in a first 
time, and then by Calude, Jiirgensen and Zimand [2 J and Calude and Jiirgensen [4]. A state 
of the art is given in [3|. Those results state that in both topological and probabilistic terms, 
incompleteness is a widespread phenomenon. Indeed, unprovability appears as the norm for 
true statements while provability appears to be rare. This interesting result brings two more 
questions. Which true statements are provable, and why are they provable when other ones 
are unprovable? 

Chaitin |5j proposed an "heuristic principle" to answer the second question: the theorems 
of a finitely- specified theory cannot be significantly more complex than the theory itself. It 
was proven [4] that Chaitin's "heuristic principle" is valid for an appropriate measure. This 
measure is based on the program-size complexity: The complexity H(s) of a binary string 
s is the length of the shortest program for a self-delimiting Turing machine (to be defined 
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in the next section) to calculate s (see O [6l [H [9]). We consider the following computable 
variation of the program-size complexity: 

S(x) = H(x) - \x\ . 

This measure gives us some indications about the reasons of unprovability of certain 
statements. It would be very interesting to have other results in order to understand the 
Incompleteness Theorem. Among them, one can try to prove a kind of reverse of the theorem 
Calude and Jiirgensen proved. Their theorem states that there exists a constant N such that 
any theory which satisfies the hypothesis of Godel's Theorem cannot prove any statements 
x with S(x) > N. Another question of interest could be the following: Does there exist any 
independent statements with a low 5-complexity? 

Those results are only examples of what can be investigated in this domain. Yet, such 
results seem to be hard to prove with the (5-complexity. The aim of our work is to find 
other complexities which satisfy this "heuristic principle" in order to be able to prove the 
remaining results. At this end, we introduce the notion of acceptable complexity measure 
of theorems which captures the important properties of 6. After studying the results of [I] 
about 5, we define the acceptable complexity measures. We study their properties, and try 
to find some other acceptable complexity measures, different from S. 

The paper is organized as follows. We begin in Section [2] by some notations and useful 
definitions. In Section 03 we present the results of [4] with some corrections. Section 0] 
is devoted to the definition of the acceptable complexity measure of theorems, and some 
counter-examples will be given in Section Q3 This section is also devoted to the proof of the 
independence of the conditions we impose on a complexity to be acceptable. In Section [6l 
we will be interested in the possible forms of those acceptable complexity measures. 

2 Prerequisites and notations 

In the sequel, N and Q respectively denote the sets of natural integers and rational numbers. 
For an integer i > 2, logj is the base i logarithm. We use the notations [a\ and \a] 
respectively for the floor and the ceiling of a real a. The cardinality of a set S is denoted by 
card(iS). For every integer i > 2, we fix an alphabet ATj with i elements, X* being the set of 
finite strings on JQ, including the empty string A, and \uu\j the length of the string w E Xi. 

We assume the reader is familiar with Turing machines processing strings [13] and with 
the basic notions of computability theory (see, for example |12 [ [TT | [T0]). We recall that a set 
is said computably enumerable (abbreviated c.e.) if it is the domain of a Turing machine, 
or equivalently if it can be algorithmically listed. 

The complexity measures we study are computable variation of the program-size com- 
plexity. In order to define it, we define the self-delimiting Turing machines, shortly 
machines, which are Turing machines the domain of which is a prefix- free set. A set 
S C X* is said prefix-free if no string of S is a proper extension of another one. In other 
words, if x, y E S and if there exists z such that y = xz, then z — X. We denote by 
PROGt — {x E X* : T halts on x} the program set of the Turing machine T. We recall two 
important results on prefix-free sets. If S C X* is a prefix-free set, then Kraft's Inequality 
holds: J2kLi r k ' i~ h < 1; where = {x G S : \x\ i — k}. The second result is called the 
Kraft-Chaitin Theorem and states the following: Let (nfc)fcgN be a computable sequence of 
non-negative integers such that 

oo 
k=l 

then we can effectively construct a prefix-free sequence of strings (wk)k&t such that for each 
k > 1, \wk\i = n k . 

The program-size complexity of a string x G Xq, relative to the machine T, is defined by 
Bi.T = min {\y\. : y G X* and T(y) = x} . 
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In this definition, we assume that min(0) = oo. The Invariance Theorem ensures the 
effective existence of a so-called universal machine Ui which minimize the program-size 
complexity of the strings. For every T, there exists a constant c > such that for all 
x G X*, Hi^jjiix) < Hi^T(x) + c. In the sequel, we will fix Ui and denote by Hi the 
complexity H^jj i relative to Ui. 

A Godel numbering for a formal language L C X* is a computable, one-to-one function 
g : L — > X$. By Gi, or G if there is no possible confusion, we denote the set of all 
the Godel numbering for a fixed language. In what follows, we consider theories which 
satisfy the hypothesis of Godel Incompleteness Theorem, that is finitely-specified, sound 
and consistent theories strong enough to formalize arithmetic. The first condition means 
that the set of axioms of the theory is c.e.; soundness is the property that the theory only 
proves true sentences; consistency states that the theory is free of contradictions. We will 
generally denote by T such a theory, and by T the set of theorems that T proves. 



3 The function 5 g 

We present in this section the function S g and some results about it. It was defined in [4] 
and almost all the results come from this paper. Hence, complete proofs of the results can 
be found in it. Yet, there was a mistake in the paper, and we need to modify a bit the 
definition of S g . We have to adapt the proofs with the new definition. The transformations 
are essentially cosmetic in almost all the proofs so we give only sketches of them. For 
Theorem 13, 2} there are a bit more than details to change, so we provide a complete proof of 
this result. Furthermore, we formally prove an assertion used in the proof of Theorem 13.51 
We first define, for every integer i > 2, the function Si by 

5i(x) = Hi{x) ~ \x\ t . 

Now, in order to ensure that the complexity we study is not dependent on the way we write 
the theorems, we define the ^-complexity induced by a Godel numbering g bjQ 

6 g (x) = H 2 (g(x)) - \log 2 ({) ■ |41 , 

where g is a Godel numbering the domain of which is in X* . 

The first result comes in fact from [I], and the theorem we present here is one of its 
direct corollaries. 

Theorem 3.1 ([4, Corollary 4.3]). For every t > 0, the set {x G X* : 8i(x) < t} is infinite. 

Proof. Following [TJ Theorem 5.31], for every t > 0, the set C ht = {x G X* : 5i(x) > -t} is 
immun^|. Hence, as Complex, t = {x G X* : 6i(x) > t} is an infinite subset of an immune 
set, it is immune itself. The set in the statement being the complement of the immune set 
Complex^ t , it is not computable, and in particular infinite. □ 

The next theorem states that the definitions via a Godel numbering or without this device 
are not far from each other. It allows us to work with the function Si instead of S g and thus 
to simplify the proofs thanks to the elimination of some technical details. Nevertheless, 
those details are present in the following proof. 

Theorem 3.2 ([2 Theorem 4.4]). Let A C X* be c.e. and g : A -> B* be a Godel 
numbering. Then, there effectively exists a constant c (depending upon Ui,U 2 , and g) such 
that for all u G A we have 

|fl- a (ff(u))-log 2 (i)-Hi(u)| <c (3.1) 

lr Tlie definition in [3] was S g (x) = H2(g(x)) — [log 2 i] ■ \x\ { . 

2 A set is said immune when it is infinite and contains no infinite c.e. subset. 
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Proof. We will in fact prove the existence of two constants c\ and c 2 such that on one hand 



H 2 (g(u)) < log 2 (i) • Hi(u) + ci (3.2) 

and on the other hand 

log 2 (») • Hi(u) < H 2 {g(u)) + 02. (3.3) 

For each string w 6 PROGu t , we define n w = |~log 2 (i) • This integers verify the 

following: 

j2 2-«»= J2 2-r io ^«>i*i < *~h*<i, 

wePROGui wePROG Ui wEPROGv ( 

because PROGjji is prefix-free. This inequality shows that the sequence (n w ) satisfies the 
conditions of the Kraft-Chaitin Theorem. Consequently, we can construct, for every w € 
PROGui , a binary string s w of length n w and such that the set {s w : w G PROG^ } is c.e. 
and prefix-free. Accordingly, we can construct a machine M whose domain is this set, and 
such that for every w 6 PROGjji, 

M(s w ) = g(Ui(w)). 

If we denote, for a string x £ X* , x* the lexicographically first string of length Hi{x) such 
that Ui(x*) = x, we now have M(s w *) = g(Ui(w*)) — g(w), and hence 

H M (g(w)) < \s w ,\ 2 = riog 2 (i) • Klil 

= flog 2 (i) • Hi(w)] < log 2 (i) • ffiH + 1. 

By the Invariance Theorem, we get the constant c\ such that l|3.2p holds true. 

We now prove the existence of c 2 such that Q3.3P holds true. The proof is quite similar. 
For each string w € PROGu 2 , we define m w — [log i (2) • |w| 2 l- As for the n w , the integers 
m w satisfy 

w£PROGu 2 wePROGu 2 

We can also apply the Kraft-Chaitin Theorem to effectively construct, for every w € 
PROGu 2 , a string t w £ X* of length m w and such that the set {t w : w € PROGu 2 } is 
c.e. and prefix-free. As g is a Godel numbering and hence one-to-one, we can construct 
a machine D whose domain is the previous set and such that D{t w ) = u if J7 2 (w) = g(u). 
Now, if U-i{w) = g{u), then 

H D (u) < rio gi (2) • H a l < lo gi (2) • \w\ 2 + 1 

< \o gl (2) ■ H 2 (g(u)) + d. 

So we apply the Invariance Theorem to get a constant d! such that log 2 (i) ■ Hi(u) < log 2 (i) • 
Hd{u) + d', hence 

\og 2 (i)-Hi{u)<H 2 {g{u))+d + d'. 
The constant c 2 = d + d! satisfies (13-311 . □ 



In [4], the equation l|3.ip was \S g (u) — [log 2 i] • Si(u)\ < d. Theorem 13.21 gives a similar 
result for 5, hence \S g (u) — log 2 (i) • 5i(u)\ < c + 1, where c is the constant of the theorem. 
In the proof, we supposed that A — X* but it is still valid with a proper subset of X* . 

The next corollary will be important for the generalization of 6 g we will do in the next 
section. It is the same kind of result as above, but applied to two Godel numberings. 

Corollary 3.3 ([U Corollary 4.5]). Let A C X* be c.e. and g,g' : A — ► B* be two Godel 
numberings. Then, there effectively exists a constant c (depending upon U 2 ,g and g' ) such 
that for all u 6 A we have: 

\H 2 {g(u))-H 2 (g'(u))\<c. (3.4) 
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In order to have a complete formal proof of Theorem l3.5t we need to bound the complexity 
of the set T of theorems that a theory T proves. It is the aim of the following lemma. 

Lemma 3.4. Let T be a finitely-specified, arithmetically sound (i.e. each arithmetical 
proven sentence is true), consistent theory strong enough to formalize arithmetic, and denote 
by T its set of theorems written in the alphabet Xj. Then for every x G T, 

~ • 14 + < Hi(x) < 14 + 0(1). 

Proof. For the upper bound, it is sufficient to give a way to describe those theorems using 
descriptions not greater than their lengths, and which ensure that the computer we use is 
self-delimiting. We first note that a theorem in T is a special well-formed formula. The 
bound we give is valid for the set of all the well-formed formulae. We consider the following 
program C: on its input x, C tests if a; is a well-formed formula. It outputs it if the case 
arises, and enters in an infinite loop else. 

This program has to be modified a bit as its domain is not prefix-free. The idea here is 
to add at the end of the input a marker which appears only at the end of the words. In that 
way, if x is prefix of y, then the end-marker has to appear in y. As it can only appear at the 
end of y, then x — y. It ensures that the domain is prefix-free. We now have to define an 
end-marker. It is sufficient to take an ill-formed formula. More precisely, we need a formula 
y such that for every well-formed formula x, xy is ill-formed, and for every z S X* , xyz is 
also ill-formed. For instance, we can take y = ++, where the symbol + is interpreted as 
the addition of natural numbers. There are in all formal systems plenty of possibilities for 
this y (another choice could be (+ for instance, or any ill-formed formula with parenthesis 
around). In the sequel, y represents a fixed such ill-formula. 

The new machine C works as follows: on an input z, C checks if z = xy with a certain 
x. If the case arises, it checks if x is a well-formed formula, and then outputs x if it does. In 
all the other cases, C diverges. Now, we have a new machine C whose domain is prefix-free, 
and such that Hc{x) < \x\ i + \y\ v By the Invariance Theorem, we get a constant c such 
that Hi(x) < 14 + c. 

We now prove the lower bound, that is that the complexity of a theorem has to be 
greater than a half of its length, up to a constant. The idea is the following: If we consider 
a sentence x of the set of theorems T, then it may contain some variables which cannot 
be compressed. More precisely, as we can work with many variables, it is not possible that 
for each of these variable, the word which is used to represent it has a small complexity. 
To formalize the idea, we have to define in a formal way what the variables in our formal 
language are. We consider that the variables are created as follows. A variable is denoted by 
a special character, say v, indicating that it is a variable, and then a binary-written number 
identifying each variable. This number is called the identifier of the variable. In the sequel, 
we denote by v n the variable the identifier of which is the integer n. 

Now, we have to consider the formulae defined by 

ip(m,n) = 3v m 3v n (v m = v n ). 

We suppose that m and n are random strings, that is i2,(m) > \m\ i + 0(1) and Hi{n) > 
\n\ { + 0(1). Furthermore, we suppose that H(m,n) > \m\ i + \n\ i + 0(1), in other words 
that m and n together are random. We can suppose that as such words do exist. Then 

Hi(<p(m,n)) > Hi(m) + Hi(n) + <D(1) 

> \m\ i + \n\ i + 0{l) 

> \-Mm,n)\ l + 0{l). 

Thus, we obtained the lower bound. 

□ 

Improving the bounds in this lemma seems to be hard. A preliminary work should be 
to define exactly what we accept as a formal language. 
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The next theorem is the formal version of Chaitin's "heuristic principle". The very 
substance of the proof comes from previous results. 

Theorem 3.5 ([H Theorem 4.6]). Consider a finitely-specified, arithmetically sound (i.e. 
each arithmetical proven sentence is true), consistent theory strong enough to formalize 
arithmetic, and denote by T its set of theorems written in the alphabet X{ . Let g be a Godel 
numbering for T . Then, there exists a constant N , which depends upon Ui, U2 and T, such 
that T contains no x with 5 g {x) > N . 

Proof. By Lemma 13, 4} for every x G T, Si (x) < c. Using Theorem 13.21 there exists a 
constant N such that for every x S T, S g (x) < N. □ 

The S g measure is also useful to prove a probabilistic result about independent state- 
ments. Indeed, we can prove that the probability of a true statement of length n to be 
provable tends to zero when n tends to infinity. 

Proposition 3.6 ([4, Proposition 5.1]). Let N > be a fixed integer, T C X* be c.e. and 
g : T — > B* be a Godel numbering. Then, 

lim r n ■ card{x e X* : \xL = n, S g (x) < N} = 0. (3.5) 

n — >oo 

We do not give a proof of this proposition because it is essentially technical. It can 
be found in [3]. In Section the proof of Proposition 15.61 uses the same arguments and 
differs from this one only by details. Now, we can express the probabilistic result about 
independent statements. The proof of this result can be found in [H p. 11]. 

Theorem 3.7 (|4j Theorem 5.2]). Consider a consistent, sound, finitely- specified theory 
strong enough to formalize arithmetic. The probability that a true sentence of length n is 
provable in the theory tends to zero when n tends to infinity. 

4 Acceptable complexity measures 

The function S g is our model to build the notion of acceptable complexity measure of theorems. 
At this end, we first define what a builder is, and then the properties it has to verify in 
order to be said acceptable. An acceptable complexity measure of theorems will then be a 
complexity measure built via an acceptable builder. 

Definition 4.1. For a computable function pi : N x N — > Q, we define the complexity 
measure builder p by 

p:G - [X*^Q] 

g 1 > [u i-v pi(H 2 (g{u)), \u\i)] 

The function pi is called the witness of the builder. In the sequel, we note p g (u) instead of 

Now, we define three properties that a builder has to verify to be acceptable. We recall 
that T denotes a theory which satisfy the hypothesis of Godel Incompleteness Theorem, 
and T its set of theorems. 

Definition 4.2. A builder p is said acceptable if for every g, the measure p g verifies the 
three following conditions: 

(i) For every theory T , there exists an integer Njr such that if J- h x, then p g (x) < Njr. 

(ii) For every integer N, 

lim i~ n ■ card-fx G X* : \x\ i = n and p g (x) < N} = 0. 
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(Hi) For every Godel numbering g' , there exists a constant c such that for every string 
ueX*, \p g {u) - p g >{u)\ < c. 

The first property is simply the formal version of Chaitin's "heuristic principle". The sec- 
ond one corresponds to Proposition 13.61 and eliminate trivial measures. Finally, (00) ensures 
the independence on the way the theorems are written. In other words, the properties (0), 
(0]) and ijnrjl ensure that an acceptable complexity measure satisfy Theorem [331 Proposition 
13.61 and Corollary 13.31 respectively. 

The following proposition will be useful in the sequel. It is a weaker version of the 
property (0) which is used to prove that a measure is not acceptable, and more precisely 
that it does not satisfy this first property. 

Proposition 4.3. Let p g be an acceptable complexity measure. Then there exists an integer 
N such that for every integer M > N , the set 

{x 6 X* : p g (x) < M] (4.1) 

is infinite. 

Proof. We consider a theory T and the integer Njr given by the property (0 in Definition 
14.21 Clearly, T can prove an infinity of theorems, such as "n = n" for all integer n. All of 
them have by property (0 a complexity bounded by Njr. If T is the set of theorem that T 
proves, then 

T C {xeX* : Pg {x) <N r }. 
As T is infinite, so is the set in the proposition, and it remains true for every M > Njr. □ 

We now prove that the ^-complexity is an acceptable complexity measure. This result 
is natural as the notion of acceptable complexity measure was built to generalize 5 g . 

Proposition 4.4. The function S g is an acceptable complexity measure. 

Proof. The S g function we defined plays the role of p g . We have to provide an acceptable 
builder. Let define 

6i(x,y) = x- [log 2 (i) • y| 

which plays the role of pi. Then S g (x) = Si(H2(g(xj), \x\ t ). 

In fact, the properties of 5 g proved in [4| are exactly what we need here. One can easily 
check that (0) is ensured by Theorem 13. 5} ([nf by Proposition 13.61 and (jm| by Corollary 
EH1 □ 

The goal of defining an acceptable builder and an acceptable measure is to study other 
complexities than S g . The following example proves that the program-size complexity is not 
acceptable. This result, even though it is plain, is very important. Indeed, it justifies the 
need to define other complexity measures. 

Example 4.5. A first natural complexity to study is the program-size complexity. There 
is no difficulty in verifying that H is a complexity measure. Formally, we have to define 
pi(x,y) — x and such that H2(g{x)) — Pi(x, ja:^). We study the properties of the builder 
H2{g{x))\. Let us see how it behaves with the three properties of Definition 14.21 

(i) This first property cannot be verified. Indeed, we note that 

card {a; e X* : H 2 {g{x)) < N} 

< card{y G X£ : H 2 {y) < N} 

< 2 N . 

If the property was verified, the set of theorems T proved by T would be bounded by 
2 N , a contradiction. 
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(ii) This property is on the contrary obviously verified. Indeed, as 

card {a; G X* : H 2 {g(x)) < N} < 2 N , {x £ X* : \x\ t = n and H 2 {g{x)) < N} = for 
large enough n. 

(Hi) This property corresponds exactly to Corollary 13, 3} and is verified. 

As the program-size complexity cannot be used there, we try to find other complexities 
which better reflect the intrinsic complexity. That is why we use the length of the strings 
to alter the complexity. It seems natural that the longest strings are also the most difficult 
to describ^E In the next section, we will give two other examples of builder which are not 
acceptable. 

5 Independence of the three conditions 

The aim of this section is to prove that the conditions (Q) , ijn]) and ([m| in Definition 14.21 
are independent from each other. At this end, we give two new examples of unacceptable 
builders. Each of those unacceptable builders exactly satisfy two conditions in Definition 
14.21 Furthermore, they give us a first idea of the ingredients needed to build an acceptable 
complexity builder. In particular they show us that a builder shall neither be too small nor 
too big. 

Example 5.1. Let p\ be the function defined by pj(x,y) = x/y if y * and else. It 
defines a builder p 1 and for every Godel numbering g, we can define Pg by 



p 1 M 



'^g^ 1 , if** A, 
0, else. 



We will see in the sequel that p 1 is a too small complexity. In fact, it is even bounded. 
In order to avoid this problem, we define p 2 by dividing the program-size complexity by the 
logarithm of the length. 



Example 5.2. We consider pf defined by 



if y > 1, 



10, else. 

The corresponding builder applied with a Godel numbering g defines the function 

[0, else. 

In order to make the proofs easier, we introduce a new function for each already defined 
builders. Those functions make no use of Godel numberings. They are the equivalents of Si 
for p 1 and p 2 . They can help us in the proofs because we prove first that they are up to a 
constant equal to the complexity measures. For p , we define p\ be by p\(x) — Hi(x)/ \x\ i 
if x ^ A and else. And similarly, for p 2 , we define p 2 (x) = Hi(x)/ [logi if \x\ i > 1 and 
else. 

Lemma 5.3. Let A C X* be c.e. and g : A — > B* be a Godel numbering. Then, there 
effectively exists a constant c (depending upon Ui, U 2 and g) such that for all u € A, we 
have 

4(u) -log 2 (i) ■ f4(u)\ <c, (5.1) 

J = 1,2. 



3 One has to be very careful with this statement which is not really true. 
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Proof. We first note that this difference is null for u = A in the case j = 1, and for \u\ i < 1 
in the case j — 2. In the sequel, we suppose that \u\ { > (for j = 1) or \u\ i > 1 (for j = 2). 
Theorem 13 . 21 states that 

|ff 2 ( ff ( U ))-log 2 (i)-ffiH| < c . 

We now just have to divide the whole inequality by \u\ i > 1 to obtain l|5.ip with j = 1 and 
by [logj \u\^\ which is not less than one but for finitely many u to obtain the result with 
j = 2. ' □ 

This result allows us to work with much easier forms of the complexity functions. We 
now study the properties that p l g and p. satisfy. As a corollary of the above lemma, we can 
note that both of the measures satisfy (fur]) . 

Proposition 5.4. The function p g verifies condition (0) in Definition \4-2\ but does not 
verify (fit]). 

Lemma 5.5. There exists a constant M such that for all x £ X* , pi(x) < M. 

Proof. The result is plain for x — X. We now suppose that \x\ i > 0. In view of [U Theorem 
3.22], there exist two constants a and (3 such that for all x 6 X*, 

Hi(x) < \x\ i +a-\og i \x\ i +(3, 

so, for x ^ A, 

As logj(|x|j)/ la;^ < 1 for every x ^ A, then 

p\{x) < 1 + a + p. 

Furthermore, Lemma [531 states that for every x, we have 

p\{x) < c + \og 2 {i)- p\(x) 

< c + log 2 (i)-(l + a + /3). 

Accordingly, M = \c + log 2 (i) • (1 + a + /3)] satisfies the statement of the lemma. □ 

Proof of Proposition \5.4\ The property (P is obvious since Lemma 15.51 tells us that the 
bound is valid for every sentence x, not only provable ones. On the contrary, the fact that 
p g is bounded by M implies that for N > M, the set {x £ X* : \x\ i —n and p\{x) < N} is 
the set X™. Hence the limit of |(n} is 1 instead of 0. □ 

The above proof shows us that an acceptable complexity measure cannot be too small (p 1 
is even bounded). We will now see, thanks to the complexity measure p 2 , that an acceptable 
complexity measure cannot be too big either. 

Proposition 5.6. The function p 2 g verifies condition {mP in Definition \4.H\ but does not 
verify @). 



Proof. We begin with the proof of |n| for p 2 . Theorem 15 . 31 allows us to consider p 2 instead of 
p 2 , with a new constant \(N + c)/log 2 (i)] . Indeed, it states that p 2 (x) > log 2 (i) • p 2 (x) — c, 
and consequently 

{ x £ X? : p 2 g (x) < N } C | x £ X? : p 2 < } ■ 

In order to avoid too many notations, we still denote this constant by N. 
First, we note that 

{x £ X? : p 2 (x) <N} = {x£ X? : 3y £ X f N ^ n \ U t (y) = x} . 
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Translating in terms of cardinals, we obtain 

cardjx £ XT : pf{x) < N} 

< card [x €X?: Bye xf N ' r ' og * n] , U^y) = x) 

< card {yeXf N -^ n] : \U l {y)\=n) 

< card {y e xf N ' n ° Si nl : £/,(?/) halts.} 

N- [log; n] 

< card {y 6 X fe : u i(y) halts.} 



fe=i "~ 



We extend these inequalities to the limit when n tends to infinity: 
lim r" • card {x € X™ : p 2 g (x) < N} 



n — >oo 

N- [log; n] 

'7, 

fe=l 

TV- [log; n] 



< lim V r™ ■ 

n — >-oc ' ^ 



n — * no fa ^ 



We note that 



jV' [log 4 n] m 

lim V i-N-\log,n] >rfc = nm V- r m. rfc< 

n— 'oo ^ — ' m— »oc ^ — ' 

fc=l fe=l 

Now, 

m+1 m 

lim fc-1 — ^-^ — = - ■ lim i~ m ■ r m = 0. 

m — >oo ^m-j-l — ^ — ^ m — >oo 

The last inequality comes from Kraft's inequality: 

]T*- m -r m < 1. 

m— 1 

So we can apply Stolz-Cesaro Theorem to ensure that 



TV - [log; n] 

lim V r w ^ los * n "l T fc =0. (5.2) 

I. — >rxn < * 



n— >oc 

fc=l 



On the other hand, 

lim jtf-P«*i«l-» = 0. (5.3) 



We just have to combine l|5.2p and (|5.3|1 to obtain (|h|). 

Now, it remains to prove that ([!} is not verified. At this end, we suppose that (0) holds. 
We note T the set of theorems that T proves. Note first that 

card{x e X* : \x\. = n and H 2 (g{x)) < N ■ [log,,™]} (5.4) 
< card {y e B* : H 2 (y) < N ■ [log, n] } 

< 2 /V. n /V.lo gl 2^ (55) 
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So, if ((il holds for all x G T, we have 

card {x G T : |x| = n} < cm^, (5.6) 

for every integer n, where a and j3 come from l|5.5p . 
But we now consider the set of formulae 

$fc = |qo^oQi^i ■ ■ - QkXk f\(xi = xi) : Qi G {V, 3} j . 

Each formula <p G $fc is true, and all formulae have the same length n% — O(k). Furthermore, 
card = 2 fe . 

As all those formulae belong to the predicate logic, all of them are provable in J 7 , that 
is to say they belong to T. As we can take k as big as wanted, we can also have as big 
as wanted. 

Now we have, for arbitrary large n, 2°^ formulae of length n which belong to T. That 
contradicts Q5.6J1 . and so, (0) is false. □ 

We can now prove that (0, (0]) and (00) in Definition E21 are independent from each other. 
As we know, with S g , that there exists an acceptable complexity builder, it is sufficient to 
prove that for each of the three conditions, there exists a builder which does not satisfy it 
while it satisfies both other ones. 

Theorem 5.7. Each condition in Definition \4-2\ is independent from others. 

Proof. The measure builder p 1 is an measure example which satisfies both (0) and lfm|) but 
not (EU) while p 2 does not satisfy 10) but (JTTJ) and (03) ■ To prove the complete independence 
of the three conditions, it remains to prove that a complexity measure builder can satisfy 
both |0) and (0} without satisfying (00). 

In fact, our proof here does not exactly follow the scheme we gave. It is still unknown if 
all the complexity measure builders satisfy (00) , or if there exist some of them not satisfying 
it. Thus, the proof is built as follows. We prove that either all complexity builders satisfy 
(00) , or there exists at least one complexity builder satisfying (0 and (0]) without satisfying 
(01} ■ We also give the exact question the answer of which would make the choice between 
the both possibilities. 

Let g and g' be two Godel numberings from X* to X|, and p g and p g > two complexity 
measures built with the same builder. The question is to know if H 2 (g(x)) = H2{g'{x)) 
for all but finitely many x G X* or if there exists an infinite sequence (x„)„ s n such that 
H2(g(x n )) 7^ H2{g'{x n )) for all n. Suppose that the first case holds, then for all but finitely 
many x G X* , p g (x) = pi(H 2 (g(x)), {x^) = pi(H 2 {g'(x)), |aj|J = p g >(x). Consequently 

c = max { | # 2 (.9 (V)) - H 2 {g'(x))\ : x G X*} < oo, 

and the builder p satisfy (00). 

We suppose now that the second case holds, that means that there exist infinitely many 
strings x G X* such that H%{g{x)) ^ H2{g'(x)). We consider the acceptable complexity 
measure 5 g . We define the measure p g by x \— > 8 g (x) 2 . More formally, if we denote by 5i the 
witness of the builder 5, we define the builder p via the witness pi = 5f. Let us consider the 
behaviour of this function with the three properties: 

(i) As S g is acceptable, there exists Njr such that if T h x, then 6 g (x) < Njr. Then it is 
plain that p g (x) < Njr 2 . So (i) is verified. 

(ii) For an integer N > 1, if p g {x) < N, then S g (x) < N too. So we have the following: 

{x G X*i : \x\i = n and p g (x) < N} 
C {x G X* : \x\t = n and 5 g {x) < N} . 



11 



Consequently, 

lim i~ n ■ card{x £ X*i : \x\ 4 — n and p q (x) < N} 

n— >oo 

< lim i~ n ■ card {x 6 X* : Id. = n and S„(x) < N} = 0. 

So (ii) is also verified, 
(mj We first note that 

Psfa) -Pg'( x ) 
= 5 g (xf-S g ,(xf 
= (if 2 (.g(x)) - riog 2 (i) • |x|J) 2 

= (H 2 {g{x)f-H 2 {g'{x)f) 

-2 • riog 2 (i) • |41 (H 2 (g(x))-H 2 (g'(x))). 

We know from Corollary 13.31 that (H 2 (g(x)) — H 2 {g'{x))) is bounded. Thus, we only 
need to prove that \H 2 (g(x)) 2 — H 2 (g'(x)) 2 \ is unbounded, and we will be able to 
conclude that ljm|) is not satisfied by p. Suppose that it is bounded by an integer N. 
As we have supposed that there exist infinitely many x £ X* such that H 2 (g(x)) ^ 
H 2 (g'(x)), then there exists for every integer M a string x such that H 2 {g(x)) > 
H 2 {g (x)) > iW0. Then 

H 2 (g{x)f-H 2 (g'{x)f 
= (H 2 (g(x)) - H 2 {g\x))) ■ (H 2 (g(x)) + H 2 (g'(x))) 
> 1 • (2 • M) = 2M. 

We can also conclude, using an integer M > N/2 that this bound cannot exist, that 
is ((TTTJ) is not satisfied. 

□ 



6 Form of the acceptable complexity measures 

The aim of this section is to give some conditions that a complexity measure has to verify 
to be acceptable. More precisely, we will study some conditions a builder, and in particular 
its witness, has to verify such that the complexity measures it builds are acceptable ones. 
We restrict our study to particular witnesses, such as linear functions in both variables, or 
functions defined by 

where / is a computable function. 

Our first result shows a kind of stability of the acceptable complexity measures. Fur- 
thermore, it makes the following proofs easier. 

Proposition 6.1. Let p g be an acceptable complexity measure, and a, (3 € Q such that 
a > 0. Then a • p g + (3 is also an acceptable complexity measure. 



Proof. Property in Definition 14.21 remains true with a new constant a ■ N + (3 instead of 
N . In the same way, 

{x e X* : \x\ % = n and a ■ p g (x) + j3 < N} 

N - (3 



C IxE X* : \x\j — n and p g (x) < 



a 



4 We can impose here without any loss of generality that H2(g(x)) > H2(g'(x)) because the converse situation 
would be equivalent. 
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hence Property (JuJ) is verified. Now, if we consider two Godel numberings g and g', 



\(a ■ p g {x) + (3) - (a- Pg'(x) + (3)\ = a ■ \p g (x) - p g <(x)\ < a ■ c, 

which proves that Property lpH|) is retained. □ 

We start studying the linear in both variables witnesses. The result we obtain is partial. 
However, as discussed after Lemma \3A\ this result is not likely to be improved without a 
complete study of the definition of the formal languages. 

Proposition 6.2. Let f be a function of two variables, linear in both variables such that 
pi defined by Pi(x) = [f(x)\ is computable. If pi defines an acceptable complexity measure, 
then there exist a,b and e, a > and 1/2 < e < 1, such that 

Pi(x, y) = [a ■ (x - e • log 2 (i) -y) + b\. 

Proof. We consider any function which satisfies the hypothesis. Then there exist a, (3 and 
7 such that 

Pi{ x > v) = l ax -0y + i x v\ ■ 

Proposition 16.11 allows us to fix Pi{0, 0) = 0. Of course, it would be equivalent to consider 
ax + /3y + jxy, but the chosen version simplifies the notations. Let /?' be such that (3 = 
(3' -log 2 (i). The proof is done in several steps. We start by showing that one at least of a and 
7 has to be different from zero, then that 7 = 0. After that, we prove that a/2 < (3' < a. 

Suppose that a = 7 = 0. Then p g {x) — — \(3\x\^\. If f3 < 0, then Proposition 14.31 is not 
verified by our complexity measure, and hence neither is Property If (3 > 0, it is obvious 
that Property ([n} cannot hold true. 

Then, we use the property and consider the set 

{x S X* : \x\ t = n and p g (x) < N} 

(3n + N +1 
jn + a 



C {xeX* : \x\i = n and H 2 (g(x)) < 

Furthermore, 



Bn + N + 1 \ if 7^0; 

lim ———————— — — — — — = < (N + l)/a, if 7 = = 0; 



jn + a 



±00, if 7 = and f3 ^ 0. 



The only solution is the third one because in order to satisfy <(TJ) , this limit has to be infinite. 
Indeed, if it is finite, we can use the same proof as in Proposition 15.61 to conclude to a 
contradiction. So we know that 7 = 0, and hence that a 7^ 0. We can right now say that a 
and (3 have the same sign, because the limit cannot be —00. Using Proposition 16. 11 we can 
assume that a = 1. Indeed, a < is not possible because of Property (jirj). 

To make easier the remaining of the proof, we define an auxiliary measure as we did in 
Sections [3] and [5] for 5, p 1 and p 2 . Let pi be defined by 

p i (x)=[H i (x)-/3' -\x\.\. 

Applying Theorem 13.21 we get a constant c such that for every x, 

\p g {x) - log 2 (i) • pi{x)\ < c. 

We will now use the property (Jn]) to have other information on f3' , and hence (3. We only 
know at that stage that (3' > 0. We consider the set 

{x e X* : \x\ i = n and p g (x) < N} 
C {x £ X* : \x\ t = n and H,(x) < f3' ■ n + N + c + 1} . 
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If /?' > 1, then for every constant d, if we choose n large enough we have 0' ■ n > n + d- logn. 
And we can use the inequality Hi{x) < |x| i + C(log i |x|j) (see [H Theorem 3.22]) to conclude 
that the above set is A™. And so, property (p} is not verified, the limit being 1. 

Using now the lower bound in Lemma [33 we know that for every proven sentence x, 

Suppose that f3' < 1/2. Then for every x such that T h x, 



Pi 0*0 



Thus, |0) cannot be verified. □ 



We study another kind of witnesses. Functions defined by 

x 



Pi(x,y) 



where / is a computable function may be interesting because they are the only reasonable 
candidates for being witness of multiplicative complexity measures. Indeed, a complexity of 
the form H 2 (g{x)) ■ \x\ t has no chance to satisfy the desired properties. Unfortunately, such 
functions never define acceptable measures. 

Proposition 6.3. Let f be a computable function, and pi defined by 



x 



Then the complexity measure builder the witness of which is pi cannot satisfy at the same 
time properties f|) and Im)). 

Proof. Suppose that p g {x) = pi(H 2 (g(x)), \x\^) satisfy (0). Then consider the set 

{x G X* : \x\i = n and H 2 {g{x)) < N ■ f(n)} . 

Its cardinal is at most 2 N '^ n K Furthermore, this set contains the set of all the sentences in 
T the length of which is n. Hence, 

card{a; £ T : \x\ i = n} < 2 N -^ n \ (6.1) 



Now, we give a lower bound to this cardinal. The proof of Proposition 15.61 shows that 
this cardinal is greater to 2°w. Accordingly, there exists a constant c such that 

cardie T: \x\ . = n} > 2 c ' n . (6.2) 
We also obtain that 2 C ™ < 2 N ' fi - n \ We can conclude that 

f(n) > £ ■ n. (6.3) 

We now follow the proof we made to show that Pg does not satisfy ((n} . We can define 

Hj(x) 

and we prove as for p 1 and p 2 that there exists a constant d such that 

\p g (x) -log 2 (i) • pi(x)\ < d. 



The proof of Lemma T5.3I is still valid here. In the same way, we extend Lemma [5.51 to p g , 
namely there exists a constant M such that p g is bounded by M. Considering p g instead of 
p g has just an influence on the value of the constant M. 

Now, we have to note that for N > M, the set {x £ X* : \x\ i — n and p g {x) < N} is the 
set X" to conclude that property is not verified. □ 
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7 Concluding remarks 



In this paper, we have studied the S g complexity function defined by Calude and Jiirgensen 
[4]. This study has led us to modify a bit the definition of S g in order to correct some 
of the proofs. Then, we have been able to propose a definition of acceptable complexity 
measure of theorem which captures the main properties of 5 g . Studying some complexity 
measures, we have shown that the conditions of acceptability are quite hard to complete. 
Yet, the definition seems to be robust enough to allow some investigations to find other 
natural acceptable complexity measures. 

There remain some open questions. Among them, we can express the following ones: 

• Can we improve the bounds of Lemma 13.41 ? This question could be interesting not 
only to improve Proposition 16.21 but also for itself: How simple are the well-formed 
formulae, and in other words, to what extent can we use their great regularities to 
compress them? Yet, as already discussed, this question needs to be better defined. 
In particular, one has to investigate about the definition of the formal languages. The 
answer seems to be very dependent on the considered language. 

• Do there exist some acceptable complexity measure which are very different from 8 g l 
The idea here is to find some measures with which we go further on the investigations 
about the roots of unprovability. 

• In view of the proof of Theorem I5.7[ if we have two Godel numberings g and g' , does 
the equality H2(g(x)) = H2{g'(x)) hold for all but finitely many x or are those two 
quantities infinitely often different from each other? 

Those few questions are added to the ones Calude and Jiirgensen expressed in [4]. The 
goal of finding new acceptable complexity measures is to have new tools to try to answer 
their questions, as the existence of independent sentences of small complexity. 
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